Dynamic SDN-Based Radio Access Network Slicing With Deep Reinforcement Learning for URLLC and eMBB Services

نویسندگان

چکیده

Radio access network (RAN) slicing is a key technology that enables 5G to support heterogeneous requirements of generic services, namely ultra-reliable low-latency communication (URLLC) and enhanced mobile broadband (eMBB). In this paper, we propose two time-scales RAN mechanism optimize the performance URLLC eMBB services. large time-scale, an SDN controller allocates radio resources gNodeBs according short each gNodeB its available end-users requests, if needed, additional from adjacent gNodeBs. We formulate problem as non-linear binary program prove NP-hardness. Next, for model Markov decision process (MDP), where large-time scale modeled single agent MDP whereas shorter time-scale multi-agent MDP. leverage exponential-weight algorithm exploration exploitation (EXP3) solve single-agent deep Q-learning (DQL) resource allocation. Extensive simulations show our approach efficient under different parameters configuration it outperforms recent benchmark solutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Reinforcement Learning for Dynamic Multichannel Access

We consider the problem of dynamic multichannel access in a Wireless Sensor Network (WSN) containing N correlated channels, where the states of these channels follow a joint Markov model. A user at each time slot selects a channel to transmit a packet and receives a reward based on the success or failure of the transmission, which is dictated by the state of the selected channel. The objective ...

متن کامل

Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks

We consider a dynamic multichannel access problem, where multiple correlated channels follow an unknown joint Markov model. A user at each time slot selects a channel to transmit data and receives a reward based on the success or failure of the transmission. The objective is to find a policy that maximizes the expected long-term reward. The problem is formulated as a partially observable Markov...

متن کامل

Radio Access Network Resource Slicing for Flexible Service Execution

Network slicing is a key enabler for the serviceoriented 5G vision, that aims to satisfy various per-service requirements. Unlike core network slicing, radio access network (RAN) slicing is still at its infancy, with several works just starting to investigate the challenges and potentials to enable a mutli-tenant, multi-service RAN. One of the major challenges in RAN slicing is to provide diffe...

متن کامل

A Novel Dynamic Spectrum Access Framework Based on Reinforcement Learning for Cognitive Radio Sensor Networks

Cognitive radio sensor networks are one of the kinds of application where cognitive techniques can be adopted and have many potential applications, challenges and future research trends. According to the research surveys, dynamic spectrum access is an important and necessary technology for future cognitive sensor networks. Traditional methods of dynamic spectrum access are based on spectrum hol...

متن کامل

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Network Science and Engineering

سال: 2022

ISSN: ['2334-329X', '2327-4697']

DOI: https://doi.org/10.1109/tnse.2022.3157274